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Abstract 

The Chi-square statistic is a non-parametric (distribution free) tool designed to analyze group differences when the dependent variable is measured 
at a nominal level. Like all non-parametric statistics, the Chi-square is robust with respect to the distribution of the data. Specifically, it does not 
require equality of variances among the study groups or homoscedasticity in the data. It permits evaluation of both dichotomous independent va- 
riables, and of multiple group studies. Unlike many other non-parametric and some parametric statistics, the calculations needed to compute the 
Chi-square provide considerable information about how each of the groups performed in the study. This richness of detail allows the researcher to 
understand the results and thus to derive more detailed information from this statistic than from many others. 

The Chi-square is a significance statistic, and should be followed with a strength statistic. The Cramer's V is the most common strength test used 
to test the data when a significant Chi-square result has been obtained. Advantages of the Chi-square include its robustness with respect to dis- 
tribution of the data, its ease of computation, the detailed information that can be derived from the test, its use in studies for which parametric 
assumptions cannot be met, and its flexibility in handling data from both two group and multiple group studies. Limitations include its sample size 
requirements, difficulty of interpretation when there are large numbers of categories (20 or more) in the independent or dependent variables, and 
tendency of the Cramer's V to produce relative low correlation measures, even for highly significant results. 
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Introduction 

The Chi-square test of independence (also known 
as the Pearson Chi-square test, or simply the Chi- 
square) is one of the most useful statistics for test- 
ing hypotheses when the variables are nominal, as 
often happens in clinical research. Unlike most sta- 
tistics, the Chi-square (x 2 ) can provide information 
not only on the significance of any observed dif- 
ferences, but also provides detailed information 
on exactly which categories account for any differ- 
ences found. Thus, the amount and detail of infor- 
mation this statistic can provide renders it one of 
the most useful tools in the researcher's array of 
available analysis tools. As with any statistic, there 
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are requirements for its appropriate use, which are 
called "assumptions" of the statistic. Additionally, 
the x 2 is a significance test, and should always be 
coupled with an appropriate test of strength. 

The Chi-square test is a non-parametric statistic, 
also called a distribution free test. Non-parametric 
tests should be used when any one of the follow- 
ing conditions pertains to the data: 

1 . The level of measurement of all the variables is 
nominal or ordinal. 

2. The sample sizes of the study groups are un- 
equal; for the x 2 the groups may be of equal 
size or unequal size whereas some parametric 
tests require groups of equal or approximately 
equal size. 

3. The original data were measured at an interval 
or ratio level, but violate one of the following 
assumptions of a parametric test: 
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a) The distribution of the data was seriously 
skewed or kurtotic (parametric tests assume 
approximately normal distribution of the de- 
pendent variable), and thus the researcher 
must use a distribution free statistic rather than 
a parametric statistic. 

b) The data violate the assumptions of equal vari- 
ance or homoscedasticity. 

c) For any of a number of reasons (1), the continu- 
ous data were collapsed into a small number of 
categories, and thus the data are no longer in- 
terval or ratio. 

Assumptions of the Chi-square 

As with parametric tests, the non-parametric tests, 
including the x 2 assume the data were obtained 
through random selection. However, it is not un- 
common to find inferential statistics used when 
data are from convenience samples rather than 
random samples. (To have confidence in the re- 
sults when the random sampling assumption is vi- 
olated, several replication studies should be per- 
formed with essentially the same result obtained). 
Each non-parametric test has its own specific as- 
sumptions as well. The assumptions of the Chi- 
square include: 

1) The data in the cells should be frequencies, 
or counts of cases rather than percentages or 
some other transformation of the data. 

2) The levels (or categories) of the variables are 
mutually exclusive. That is, a particular subject 
fits into one and only one level of each of the 
variables. 

3) Each subject may contribute data to one and 
only one cell in the x 2 . If, for example, the same 
subjects are tested over time such that the 
comparisons are of the same subjects at Time 1, 
Time 2, Time 3, etc., then x 2 may not be used. 

4) The study groups must be independent. This 
means that a different test must be used if the 
two groups are related. For example, a differ- 
ent test must be used if the researcher's data 
consists of paired samples, such as in studies in 
which a parent is paired with his or her child. 
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5) There are 2 variables, and both are measured 
as categories, usually at the nominal level. How- 
ever, data may be ordinal data. Interval or ratio 
data that have been collapsed into ordinal cat- 
egories may also be used. While Chi-square has 
no rule about limiting the number of cells (by 
limiting the number of categories for each vari- 
able), a very large number of cells (over 20) can 
make it difficult to meet assumption #6 below, 
and to interpret the meaning of the results. 

6) The value of the cell expecteds should be 5 or 
more in at least 80% of the cells, and no cell 
should have an expected of less than one (3). 
This assumption is most likely to be met if the 
sample size equals at least the number of cells 
multiplied by 5. Essentially, this assumption 
specifies the number of cases (sample size) 
needed to use the x 2 for any number of cells in 
that x 2 - This requirement will be fully explained 
in the example of the calculation of the statistic 
in the case study example. 

Case study 

To illustrate the calculation and interpretation of 
the x 2 statistic, the following case example will be 
used: 

The owner of a laboratory wants to keep sick leave 
as low as possible by keeping employees healthy 
through disease prevention programs. Many em- 
ployees have contracted pneumonia leading to 
productivity problems due to sick leave from the 
disease. There is a vaccine for pneumococcal pneu- 
monia, and the owner believes that it is important 
to get as many employees vaccinated as possible. 
Due to a production problem at the company that 
produces the vaccine, there is only enough vac- 
cine for half the employees. In effect, there are two 
groups; employees who received the vaccine and 
employees who did not receive the vaccine. The 
company sent a nurse to every employee who 
contracted pneumonia to provide home health 
care and to take a sputum sample for culture to 
determine the causative agent. They kept track of 
the number of employees who contracted pneu- 
monia and which type of pneumonia each had. 
The data were organized as follows: 
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• Group 1: Not provided with the vaccine (unvac- 
cinated control group, N = 92) 

• Group 2: Provided with the vaccine (vaccinated 
experimental group, N = 92) 

In this case, the independent variable is vaccina- 
tion status (vaccinated versus unvaccinated). The 
dependent variable is health outcome with three 
levels: 

• contracted pneumoccal pneumonia; 

• contracted another type of pneumonia; and 

• did not contract pneumonia. 

The company wanted to know if providing the 
vaccine made a difference. To answer this ques- 
tion, they must choose a statistic that can test for 
differences when all the variables are nominal. The 
X 2 statistic was used to test the question, "Was 
there a difference in incidence of pneumonia be- 
tween the two groups?" At the end of the winter, 
Table 1 was constructed to illustrate the occur- 
rence of pneumonia among the employees. 



Table 1. Results of the vaccination program. 



Health Outcome 


Unvaccinated 


Vaccinated 


Sick with pneumococcal 
pneumonia 


23 


5 


Sick with non-pneumococcal 
pneumonia 


8 


10 


No pneumonia 


61 


77 



Calculating Chi-square 

With the data in table form, the researcher can 
proceed with calculating the x 2 statistic to find out 



if the vaccination program made any difference in 
the health outcomes of the employees. The for- 
mula for calculating a Chi-Square is: 



(0-E)< 



Where: 

O = Observed (the actual count of cases in each 
cell of the table) 

E = Expected value (calculated below) 
X 2 = The cell Chi-square value 

X* 2 = Formula instruction to sum all the cell Chi- 
square values 

Xf_j = i-j is the correct notation to represent all the 
cells, from the first cell (/) to the last cell (/); in this 
case Cell 1 (/') through Cell 6 (j). 

The first step in calculating a x 2 is to calculate the 
sum of each row, and the sum of each column. 
These sums are called the "marginals" and there 
are row marginal values and column marginal val- 
ues. The marginal values for the case study data 
are presented in Table 2. 

The second step is to calculate the expected values 
for each cell. In the Chi-square statistic, the "ex- 
pected" values represent an estimate of how the 
cases would be distributed if there were NO vac- 
cine effect. Expected values must reflect both the 
incidence of cases in each category and the unbi- 
ased distribution of cases if there is no vaccine ef- 
fect. This means the statistic cannot just count the 
total N and divide by 6 for the expected number in 
each cell. That would not take account of the fact 
that more subjects stayed healthy regardless of 



Table 2. Calculation of marginals. 



Health Outcome 


Not vaccinated 
Coll 


Vaccinated 
Col 2 


Row marginals 
(Row sum) 


Sick with pneumococcal pneumonia 


23 


5 


28 


Sick with non-pneumococcal pneumonia 


8 


10 


18 


Stayed healthy 


61 


77 


138 


Column marginals (Sum of the column) 


92 


92 


N = 184 
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whether they were vaccinated or not. Chi-Square 
expecteds are calculated as follows: 

_ M R x M c 
n 

Where: 

E = represents the cell expected value, 

M R = represents the row marginal for that cell, 

M c = represents the column marginal for that cell, 
and 

n = represents the total sample size. 

Specifically, for each cell, its row marginal is multi- 
plied by its column marginal, and that product is 
divided by the sample size. For Cell 1, the math is 
as follows: (28 x 92)/1 84 = 1 3.92. Table 3 provides the 
results of this calculation for each cell. Once the ex- 
pected values have been calculated, the cell x 2 val- 
ues are calculated with the following formula: 

2 (O-E) 2 
* =_ E- 

The cell x 2 for the first cell in the case study data is 
calculated as follows: (23-1 3.93) 2 /1 3.93 = 5.92. The 
cell x 2 value for each cellis the value in parentheses 
in each of the cells in Table 3. 

Once the cell x 2 values have been calculated, they 
are summed to obtain the x 2 statistic for the table. 
In this case, the x 2 is 12.35 (rounded). The Chi- 
square table requires the table's degrees of free- 
dom (df) in order to determine the significance 
level of the statistic. The degrees of freedom for a 
X 2 table are calculated with the formula: 

(Number of rows -1) x (Number of columns -1). 

For example, a 2 x 2 table has 1 df. (2-1) x (2-1) = 1. 
A 3 x 3 table has (3-1) x (3-1) = 4 df. A 4 x 5 table has 

Table 3. Cell expected values and (cell Chi-square values). 

Health outcome 



(4-1) x (5-1) = 3 x 4 = 12 df. Assuming a x 2 value of 
12.35 with each of these different df levels (1, 4, 
and 12), the significance levels from a table of x 2 
values, the significance levels are: df = 1, P < 0.001, 
df = 4, P < 0.025, and df = 12, P > 0.1 0. Note, as de- 
grees of freedom increase, the P-level becomes 
less significant, until the x 2 value of 12.35 is no 
longer statistically significant at the 0.05 level, be- 
cause P was greater than 0.1 0. 

For the sample table with 3 rows and 2 columns, 
df = (3-1) x (2-1) = 2 x 1 = 2. A Chi-square table of 
significances is available in many elementary statis- 
tics texts and on many Internet sites. Using a x 2 ta- 
ble, the significance of a Chi-square value of 12.35 
with 2 df equals P < 0.005. This value may be round- 
ed to P < 0.01 for convenience. The exact signifi- 
cance when the Chi-square is calculated through a 
statistical program is found to be P = 0.0011. 

As the P-value of the table is less than P < 0.05, the 
researcher rejects the null hypothesis and accepts 
the alternate hypothesis: "There is a difference in 
occurrence of pneumococcal pneumonia between 
the vaccinated and unvaccinated groups." Howev- 
er, this result does not specify what that difference 
might be. To fully interpret the result, it is useful to 
look at the cell x 2 values. 

Interpreting cell x 2 values 

It can be seen in Table 3 that the largest cell x 2 val- 
ue of 5.92 occurs in Cell 1. This is a result of the ob- 
served value being 23 while only 13.92 were ex- 
pected. Therefore, this cell has a much larger 
number of observed cases than would be expect- 
ed by chance. Cell 1 reflects the number of unvac- 
cinated employees who contracted pneumococcal 
pneumonia. This means that the number of unvac- 
cinated people who contracted pneumococcal 
pneumonia was significantly greater than expect- 
ed. The second largest cell x 2 value of 4.56 is locat- 



Not vaccinated Vaccinated 



Sick with pneumococcal pneumonia 13.92(5.92) 12.57(4.56) 

Sick with non-pneumococcal pneumonia 8.95(0.10) 9.05(0.10) 

Stayed healthy 69.12 (0.95) 69.88 (0.73) 
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ed in Cell 2. However, in this cell we discover that 
the number of observed cases was much lower 
than expected (Observed = 5, Expected = 12.57). 
This means that a significantly lower number of 
vaccinated subjects contracted pneumococcal 
pneumonia than would be expected if the vaccine 
had no effect. No other cell has a cell x 2 value 
greater than 0.99. 

A cell x 2 value less than 1.0 should be interpreted 
as the number of observed cases being approxi- 
mately equal to the number of expected cases, 
meaning there is no vaccination effect on any of 
the other cells. In the case study example, all other 
cells produced cell x 2 values below 1.0. Therefore 
the company can conclude that there was no dif- 
ference between the two groups for incidence of 
non-pneumococcal pneumonia. It can be seen 
that for both groups, the majority of employees 
stayed healthy. The meaningful result was that 
there were significantly fewer cases of pneumo- 
coccal pneumonia among the vaccinated employ- 
ees and significantly more cases among the unvac- 
cinated employees. As a result, the company 
should conclude that the vaccination program did 
reduce the incidence of pneumoccal pneumonia. 

Very few statistical programs provide tables of cell 
expecteds and cell x 2 values as part of the default 
output. Some programs will produce those tables 
as an option, and that option should be used to ex- 
amine the cell x 2 values. If the program provides an 
option to print out only the cell x 2 value (but not cell 
expecteds), the direction of the x 2 value provides in- 
formation. A positive cell x 2 value means that the 
observed value is higher than the expected value, 
and a negative cell x 2 value (e.g. -12.45) means the 
observed cases are less than the expected number 
of cases. When the program does not provide either 
option, all the researcher can conclude is this: The 
overall table provides evidence that the two groups 
are independent (significantly different because P < 
0.05), or are not independent (P > 0.05). Most re- 
searchers inspect the table to estimate which cells 
are overrepresented with a large number of cases 
versus those which have a small number of cases. 
However, without access to cell expecteds or cell 
X 2 values, the interpretation of the direction of the 
group differences is less precise. Given the ease of 
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calculating the cell expecteds and x 2 values, re- 
searchers may want to hand calculate those values 
to enhance interpretation. 

Chi-square and closely related tests 

One might ask if, in this case, the Chi-square was 
the best or only test the researcher could have 
used. Nominal variables require the use of non- 
parametric tests, and there are three commonly 
used significance tests that can be used for this 
type of nominal data. The first and most common- 
ly used is the Chi-square. The second is the Fisher's 
exact test, which is a bit more precise than the Chi- 
square, but it is used only for 2 x 2 Tables (4). For 
example, if the only options in the case study were 
pneumonia versus no pneumonia, the table would 
have 2 rows and 2 columns and the correct test 
would be the Fisher's exact. The case study exam- 
ple requires a 2 x 3 table and thus the data are not 
suitable for the Fisher's exact test. 

The third test is the maximum likelihood ratio Chi- 
square test which is most often used when the 
data set is too small to meet the sample size as- 
sumption of the Chi-square test. As exhibited by 
the table of expected values for the case study, the 
cell expected requirements of the Chi-square were 
met by the data in the example. Specifically, there 
are 6 cells in the table. To meet the requirement 
that 80% of the cells have expected values of 5 or 
more, this table must have 6 x 0.8 = 4.8 rounded to 
5. This table meets the requirement that at least 5 
of the 6 cells must have cell expected of 5 or more, 
and so there is no need to use the maximum likeli- 
hood ratio chi-square. Suppose the sample size 
were much smaller. Suppose the sample size was 
smaller and the table had the data in Table 4. 



Table 4. Example of a table that violates cell expected values. 



Health outcome 


Not Vaccinated Vaccinated 


Pneumococcal Pneumonia 


4(2.22)/1.42 0(1.75)/1.78 


Non-pneumococcal 
Pneumonia 


2(1.67)/0.07 1 (1.33)/0.08 


Stayed healthy 


14(1 6.1 U/0.28 15 (12.89J/0.35 


Sample raw data presented first, 


sample expected values in 


parentheses, and cell follow the slash. 
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Although the total sample size of 39 exceeds the 
value of 5 cases x 6 cells = 30, the very low distri- 
bution of cases in 4 of the cells is of concern. When 
the cell expecteds are calculated, it can be seen 
that 4 of the 6 cells have expecteds below 5, and 
thus this table violates the x 2 test assumption. This 
table should be tested with a maximum likelihood 
ratio Chi-square test. 

When researchers use the Chi-square test in viola- 
tion of one or more assumptions, the result may or 
may not be reliable. In this author's experience of 
having output from both the appropriate and in- 
appropriate tests on the same data, one of three 
outcomes are possible: 

First, the appropriate and the inappropriate test 
may give the same results. 

Second, the appropriate test may produce a signif- 
icant result while the inappropriate test provides a 
result that is not statistically significant, which is a 
Type II error. 

Third, the appropriate test may provide a non-sig- 
nificant result while the inappropriate test may 
provide a significant result, which is a Type I error. 

Strength test for the Chi-square 

The researcher's work is not quite done yet. Find- 
ing a significant difference merely means that the 
differences between the vaccinated and unvacci- 
nated groups have less than 1.1 in a thousand 
chances of being in error (P = 0.0011). That is, there 
are 1 .1 in one thousand chances that there really is 
no difference between the two groups for con- 
tracting pneumococcal pneumonia, and that the 
researcher made a Type I error. That is a sufficiently 
remote probability of error that in this case, the 
company can be confident that the vaccination 
made a difference. While useful, this is not com- 
plete information. It is necessary to know the 
strength of the association as well as the signifi- 
cance. 

Statistical significance does not necessarily imply 
clinical importance. Clinical significance is usually 
a function of how much improvement is produced 
by the treatment. For example, if there was a sig- 



nificant difference, but the vaccine only reduced 
pneumonias by two cases, it might not be worth 
the company's money to vaccinate 184 people (at 
a cost of $20 per person) to eliminate only two cas- 
es. In this case study, the vaccinated group experi- 
enced only 5 cases out of 92 employees (a rate of 
5%) while the unvaccinated group experienced 23 
cases out of 92 employees (a rate of 25%). While it 
is always a matter of judgment as to whether the 
results are worth the investment, many employers 
would view 25% of their workforce becoming ill 
with a preventable infectious illness as an undesir- 
able outcome. There is, however, a more standard- 
ized strength test for the Chi-Square. 

Statistical strength tests are correlation measures. 
For the Chi-square, the most commonly used 
strength test is the Cramer's V test. It is easily cal- 
culated with the following formula: 




Where n is the number of rows or number of col- 
umns, whichever is less. For the example, the V is 
0.259 or rounded, 0.26 as calculated below. 

The Cramer's V is a form of a correlation and is in- 
terpreted exactly the same. For any correlation, a 
value of 0.26 is a weak correlation. It should be 
noted that a relatively weak correlation is all that 
can be expected when a phenomena is only par- 
tially dependent on the independent variable. 

In the case study, five vaccinated people did con- 
tract pneumococcal pneumonia, but vaccinated 
or not, the majority of employees remained 
healthy. Clearly, most employees will not get pneu- 
monia. This fact alone makes it difficult to obtain a 
moderate or high correlation coefficient. The 
amount of change the treatment (vaccine) can 
produce is limited by the relatively low rate of dis- 
ease in the population of employees. While the 
correlation value is low, it is statistically significant, 
and the clinical importance of reducing a rate of 
25% incidence to 5% incidence of the disease 
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would appear to be clinically worthwhile. These 
are the factors the researcher should take into ac- 
count when interpreting this statistical result. 

Summary and conclusions 

The Chi-square is a valuable analysis tool that pro- 
vides considerable information about the nature 
of research data. It is a powerful statistic that ena- 
bles researchers to test hypotheses about varia- 
bles measured at the nominal level. As with all in- 
ferential statistics, the results are most reliable 
when the data are collected from randomly select- 
ed subjects, and when sample sizes are sufficiently 
large that they produce appropriate statistical 
power. The Chi-square is also an excellent tool to 
use when violations of assumptions of equal vari- 
ances and homoscedascity are violated and para- 
metric statistics such as the t-test and ANOVA can- 
not provide reliable results. As the Chi-Square and 
its strength test, the Cramer's V are both simple to 
compute, it is an especially convenient tool for re- 



searchers in the field where statistical programs 
may not be easily accessed. However, most statisti- 
cal programs provide not only the Chi-square and 
Cramer's V, but also a variety of other non-para- 
metric tools for both significance and strength 
testing. 
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